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TN T RR SPECIFICATION : 

Please amend the paragraph beginning at page 6, line 8, as follows. 

In section (3)(d), the development of a method for computing acoustic 
confusability between models is begun; in this section, confusability of densities is also 
5 discussedr This development proceeds by the following steps. In section (3)(e), the 
interaction between paths and densities is shown and discussed. In section (3)(f), an 
algorithm for computing confusability of hidden Markov models is disclosed. This 
method, as shown in section (3)(f), comprises the following: (i) constructing a product 
machine; (ii) defining the transition probabilities and synthetic likelihoods of the arcs and 
10 states of this product machine; and (iii) determining a probability flow matrix and 
relating the result of a certain matrix computation to acoustic confusability. 

Please amend the paragraph beginning at page 13, line 26, as follows. 
Thus, under these assumptions, the value p(a(w) \ h x) is now the quantity 
15 that needs to be determined, in either the continuous or discrete cases. This disclosure 
disclosu or will treat the continuous case in more detail later. 

Please amend the paragraph beginning at page 14, line 17, as follows, - 
If there were only one single model K'l*) for word x (that is, one single 

20 model for evaluating acoustic event probabilities, on the assumption that x wa$ the word 
being pronounced), then pW™) I -v) would be the probability that this model assigns to 
the observation a(w). But, in general, a given word x has many pronunciations. For 
instance, the word 4< the" may be pronounced 4 TH Uff* or "TH IY f " wherein the "TH, ?> 
"UH," "TH " and "IY" are phonemes (also called "phones")- Phonemes are small, 

25 indivisible acoustic elements of speech; here UH represents a schwa sound, whereas IY 
represents a long e vowel. There are about 51. phonemes in English, although some 
speech recognition systems might have more or less phonemes. These pronunciations are 
referred to as lexemes or baseforms, and the following may be written: 

30 x=UHx\ l 2 (x ),..., /"*(*)}, (10) 
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to indicate that a word ^.admits of multiple pronunciations (•*). ^C*) and so on. Here n x 
is the number of distinct pronunciations recognized for x and each W is one lexeme. 
Carrying this notation a little further, it is possible to write 1*)e * for an arbitrary lexeme 
l(x) associated with the word x, and 2 ^for a sum in which l(x) varies over the lexeme 
5 set for x. 

Please amend the paragraph beginning at page 60, line 10, as follows. 
Turning now to FIG. 9, this figure shows a probability flow matrix +090 
that is populated using the product machine 830 of FIG. 8. Also shown in FIG. 9 is a 

10 column +9£9 930 that corresponds to the leftmost column of <('- M ) I '). Probability 
flow matrix +99© contains probability flow matrix 600, which was shown FIG. 6. 
Additionally, the new state w< of the synthesizer model 810 of FIG. 8 causes entries 1001 
through 1010 to be populated with probabilities. Determination of these types of 
probabilities has been previously discussed in reference to FIG. 6. From FIG. 9 and the 

15 previous discussion on Computational Caching, it can be seen that n through r 9 will 
already be calculated when probability flow matrix 600 is used to determine acoustic 
confusability for synthesizer model 410 and evaluation model 420. Therefore, these may 
he held and reused when determining acoustic confusability from probability flow matrix 
4009, which derives from synthesizer model 810 and evaluation model 420. This is a 

20 tremendous time savings, as r, 0 through m are the only values that need to be determined 
when probability flow matrix +099 is used to determine acoustic confusability. For 
instance, it could be that synthesizer model 4 10 is the synthesizer model for "similar" and 
synthesizer model 810 is the synthesizer model for "similarity." The results n through r 9 
may be held and reused during the probability flow matrix calculations for "similarity." 

25 Likewise, the synthesizer model 810 could be the synthesizer model for "similar." The 
results for "similar" could be reused when computing acoustic confusability for 
"similarity." Note that the ordering of the states of the models will affect whether cachin g 
can be used for prefixes, suffixes or both. 
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